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MELANIE - VIDA, Visual Documents Analysis 

(Analysis and Visualisation of Texts) 



VIDA - Visual Document Analysis 

• Interactive user interface for visual representation of documents and analysis 
results 

• Analysis of documents against defined criteria e. g. 

- number of words in the text 

- entities such as places, names, organizations 

• Statistical analysis of texts 

- keywords 

- n-grams 

- word associations 

• Automatic national language identification for texts in 

- German 

- English 

- French 

- Italian 

• Text abstraction for speed reading and faster scanning of information 

• Analysis results displayed in tables and graphs 

- word frequencies 

- associations 

• Text tagging, where texts are highlighted according to relevant criteria 

• Relations between keywords 

• Finding similar texts 



Finding important text entities 

Words in the various meaning categories are 
found automatically. Different highlighting is 
used in the text for each of the meaning 
categories. 



Wberl Ernstem was bom ai Ur n, infAirttumt • u.g naiiy. on 
March 14, 1079 Sin weeks laler the family moved to Muntcii 
and he began hi* schooling Ihere tit Ihe I Ltitpald Gymnasium 
Later, they moved t* h.iV and Aibeh continued his 
education at Au; ou, and In 1 096 he entered Ihe 

Swiss Federal Polytechnic School in Zu It to be trained as 
a teacher in physics and mathematics In 1 901 , ihe year he 
gamed his diploma, he acquired Swiss citizenship and, as he 
was unable So find a teaching post, ha accepted a position 
as technical assistant rn the Swiss Patent Office In 1 90S 
he obtained his doctors degree. 



During Ms slay al the patent Office , and in his spare time, 

he produced mut h or his remarkable work arid In 1 908 ha was 

appointed PrivatdozerHin Qmrntt In 1 909 he became Professor 

Etfra ordinary at Zurich, in 1 911 Professor of Theoretical 

Physics an ‘ i returning to i in the following year 

to dll a similar post In 1914 he was appointed Director of 

the Kaiser Wilhelm Physical Institute und 

Professor in the Untwrsitv of He became a Oeiman 

c ibsen in 191 i end re mained in Berlin until 193 3 when he renounced 

Ms citizenship tor political reasons and emigrated lo Amukca 

to lake the position of Professor of Theoretical Physics al 

Pitncel w* HebecamosUndeitSMtr s triton In 1 940 end 

reti red from his pesl in 1 9*5 

After World War II, Einstein was a leading flguie in the 
World Government Movement, he was offered Ihe Presidency of 
the Slate of tor. mi, which he declined, and he collaborated 
with Dr Chaim Weiimann in establishing Ihe 

Hefaietif university of Jpi union 
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Metadata 



General information on the analysed 
document is compiled as metadata. 
This metadata includes the following 
information, for example: 

• Document name 

• Source directory 

• Creation date 

• Number of words, lines and 
characters 

• National language 



Mel add! 9 | | | Word lists | Ngtams| Tag Intel bog| 



Attribute 


Value 


ENV.dale 


ID. Mai 2005 


EMV OS 


Windows XP 6.1 


USER, name 


uueb 


USER, language 


de 


USER, region 


USER osetOir 


JCiYFOPS 


NUM CHARS 


: "'15357 ” 


NUM LINES 


109 


N'UM SENTENCES 


w 


FILE, name 


aiiwleinl .txt 


FILE.byleLength 


5357 


FlLE-direclary 


q :\TO PSMDnc umenl s 


FILE, dale 


Do, 0i Januar 1970 


FILE, encoding 


C f ' ".52 


LANGUAGE 


Engliseh - Versimgls Staalen von Amerika 


LANGUAGE RESOURCES 


cATOPS\LsnquaqiR« 80 urces\an US 


NUM WORD TYPES 


3S6 


NUM WORD TOKENS 


810 


NUM CONTE WTWQRD TYPES 


jaes 


NUM COMTE MTWDRD TOKENS 


4D2 


AV WORD LENGTH 


5 1O37OG7O07QG704 


NUM TEXT LINES 


96 


NUM PARAS 


12 



Word statistics 



A set of word statistics provides 
information on the most frequently 
used words 

and the most frequently used words 
relevant to the content i.e. 
keywords. 



Word Frequencies Content Word Frequencies 



Rank j 


Word 


Fraque. .. | 


Rank | Word 


Frequency | 


lilhe 


59:* | 


1 theory 


131 


2|oT 


5§ri 


2 einstein 


B[ 


3in 




Swoik 


7] 


land 


36? 


4 physics 




She 


34] 


5 relativity 


s; 


Shis 


251 


6?mechanics 


si 


?W 


20 


7 albert 


A 


fl|a 


M 


8 'professor 


4\ 


9 


Iheory 


“3a 


9 problems 


4| 


10 


W “ 


rd 


10 time 


3| 


11 


at 


xm 


1 ' posl 


3| 


12 


wilh 


a 




12 america 


3 


13 


einslein 


s 




13 important 


A 


“ U 


qn 


7| 




14 world 


3] 


15 


work 


7 




15 special 


1 


16 


physics 


7 




16 berlin 


~^~5l 


17 


relatNiiy 


6 


17 nob el 


3| 


13 


as 


9 




10 statistical 


I 


10 


this 


5! 




ISiyesr 


31 


20 


mechanics 


S 




20 movement 


3\ 


21 


ter 


A 




21 continued 


1 


22 


later 


4 




22 i(i Itanium 


% 


23 


from 


4 




23'printelon 


3l 


24 


albert 


4 




24 institute 


A 


25 


professor 


4 




2fi : swiss 


i 


26 


an 


4 




26 2 urich 


si 


27 


problems 


4 


il 


27 war 


A 













Keywords - Word n-grams 



Statistical determination and listing 
of all keywords and word n-grams 
(common sequences of words). 



Keywords 


Word ngrams 


Iheory 


theory of relativity 


einstein 


special theory 


work — 


albert einstein 


physics 


statistical mechanics 


relativity 


quantum theory 


mechanics 


general theory 


albert , 


princeton new jersey , 


T 

nmfoC'ftnr <— J 


T 

pnnli™ iaH fn uurirk t—J 
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National language identification 



Automatic determination of the language of 
a text. National language identification is 
currently implemented for the following 
languages: 

• German 

• English 

• French 

• Italian 

Other languages can be implemented on 
request. 




Abstraction 



A summary can be generated 
automatically to enable speed-reading 
of the text. 



Summary 

(S I) Albert Einstcen wis bom at Utn. in WurttembetE, Germany, on March 14. 1879 

(S 2) Set weeks kner the fasnfly moved to Munich and he besam his sCtmwIinE there at Hie Lwtpflld 
Gymnasium 

(3 ¥i Later, they moved to Italy and Albert c entinvcd his cducahan at Aaniu. Gwlzerlanrl and in 1 89d 
be entered the Swiss Federal Polytechnic School in Smcb to be trained as a teacher in phasic s and 
maBurnauie s 



(£ 4) In 1 90 L , the year he pined has diploma, he acquired £vn: s einsenihip and, as he was unable to 
find a teaching past, he accepted a penlwn as technical assistant in the Swim P-alenl Offic e 

(S vj b 1005 he obtained his doctor's degree 

<3 0) OwmK hu slay at the Patent Office, and un his spare lime, he prudireed much uf his remashable 
work and in 1 90S be was appeented PruraldcEertl in Berne 

(t 7) b 1909 he became Professed IbdraoirttaHry aJE Zurich. in 1 93 1 Professor of Theoretical Physics 
al Prague, returning to Zurich m the fallowing year La fill a similar past. 

(S 8) In 1 93d be was appealed Director of the Kaiser Wilhdm Physical Institute aid Professor in the 
Llmwensity of Berlin 

(£ yj He became a German ctt&en m L5 14 and renamed in Berlin until 1 933 when he renounced his 
crimasship far pntibcal imsuris and emijrniLtd io Ammca La lake the pasiban of ProfessEr of 
Theoretical Physics at Princeton* 

<S 1 0> E le became a United States cS&m in 9 940 and retired from has past in 1 SM5. 



Visualisation - Frequency 



Visual representation of the frequency of 
specified keywords in the text. 
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Visualisation - Associations 

VIDA includes the visualisation option of 
an association graph for delivering faster 
identification of the text content. 




Visualisation - Entities 



VIDA provides coloured 
highlighting of entities and also 
lists entities and keywords in a 
table. 




Finding similar texts 



VIDA provides a function that 
assesses the similarity of texts, 
for text location. This function 
automatically lists in a table all the 
texts analysed, in order of 
decreasing 

similarity. In addition, the most 
important matches are output in 
keywords. 



1.0 






einsttm_de 1 


einstein em stem if Kit phystk relativitatstheorie tkone 
albert spider 


0.75 


«instem_df 3 


ernstem tin stems ait physic relativitraiftearie albert 


0.5 


einEteLnjle.2 


einjtein ait relativitatstbeone albert 


025 


eiutea_en-2 


ernstem albert 


0125 


euisttLfl till 


Einstein 


0125 


finite in cn 3 


emit; in 


0. 125 


eoftea en4 


eirutein 


0 125 




eiastem 


0 


nn aLgaida 




0 


tn bush 
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VIDA - Specification 

Operating systems Windows, Linux 



Automatic national language identification • German 

• English 

• French 

• Italian 

• Other languages on request 

Text analysis • English 

• German 

• Other languages on request 



Statistical analysis of texts 



Based on word lists and rules 



No linguistic knowledge necessary 

Functions • Word statistics 

• Automatic determination of 
keywords and n-grams 

• Word associations 

• Histogram for word 
frequencies 

• Marking (tagging) of entities: 
proper names, places, date, etc. 

• Visualisation of associations 
and entities 



Ordering information 

VIDA-Inter Vida tool, to be used interactively, incl. graphical user interface 
VIDA-Prod Vida program to be embedded in software environment 
VIDA-Conf Vida configuration software, used for adaptation to new domain 
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Company Principles and Policy 

Technology 

...in development and company management is state-of-the-art, 
and represents only the best. 

Quality 

...in all areas of our company is regarded as the almost requirement 
for risk-free and successful cooperation with our customers, 
and business partners. 

Market Position 

...we are the specialists in the field of signal and data processing 
as well as pattern recognition, and we are glad to face competition. 

Colleagues 

... form the roots of our company, and give the performance required 
for maintaining and building the technical base, and close 
personal cooperation we have with our clientele. 

Growth 

... we strive toward a healthy, stable foundation at home and 
abroad. 

Services 

... are comprehensive and complete. As a full-system company 
we offer standard equipment, systems, and services. 

Trust 

...in the relationships to our business partners, and within our 
own company forms the basis of our business. 
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If you would like further Information about ELAMAN, 
or would like to discuss a specific requirement or project, please contact us at: 

Elaman GmbH 
German Security Solutions 
Seitzstr. 23 
80538 Munich 
Germany 

Tel: +49-89-24 20 91 80 
Fax: +49-89-24 20 91 81 
info@elaman.de 
www.elaman.de 






